Bounded Regret

Building Technology to Drive AI Governance

4 months ago 7 min read

Technically skilled people who care about AI going well often ask me: how should I spend my time if I think AI governance is important? By governance, I mean the constraints, incentives, and

Oversight Assistants: Turning Compute into Understanding

5 months ago 10 min read

Currently, we primarily oversee AI with human supervision and human-run experiments, possibly augmented by off-the-shelf AI assistants like ChatGPT or Claude. At training time, we run RLHF, where humans (and/or chat assistants)

Analyzing long agent transcripts (Docent)

a year ago 1 min read

This is a brief overview of a recent release by Transluce. You can see the full write-up on the Transluce website. AI systems are increasingly being used as agents: scaffolded systems in which

Introducing Transluce — A Letter from the Founders

2 years ago 3 min read

We are launching an independent research lab that builds open, scalable technology for understanding AI systems and steering them in the public interest. Transluce means to shine light through something to reveal its

Augmenting Statistical Models with Natural Language Parameters

2 years ago 11 min read

This is a guest post by my student Ruiqi Zhong, who has some very exciting work defining new families of statistical models that can take natural language explanations as parameters. The motivation is

Updates and Lessons from AI Forecasting

Film Study for Research

Measurement, Optimization, and Take-off Speed

Advice for Authors

Building Technology to Drive AI Governance

Oversight Assistants: Turning Compute into Understanding

Analyzing long agent transcripts (Docent)

Introducing Transluce — A Letter from the Founders

Augmenting Statistical Models with Natural Language Parameters